Search CORE

46 research outputs found

OpenChat: Advancing Open-source Language Models with Mixed-Quality Data

Author: Cheng Sijie
Li Xiangang
Liu Yang
Song Sen
Wang Guan
Zhan Xianyuan
Publication venue
Publication date: 20/09/2023
Field of study

Nowadays, open-source large language models like LLaMA have emerged. Recent developments have incorporated supervised fine-tuning (SFT) and reinforcement learning fine-tuning (RLFT) to align these models with human goals. However, SFT methods treat all training data with mixed quality equally, while RLFT methods require high-quality pairwise or ranking-based preference data. In this study, we present a novel framework, named OpenChat, to advance open-source language models with mixed-quality data. Specifically, we consider the general SFT training data, consisting of a small amount of expert data mixed with a large proportion of sub-optimal data, without any preference labels. We propose the C(onditioned)-RLFT, which regards different data sources as coarse-grained reward labels and learns a class-conditioned policy to leverage complementary data quality information. Interestingly, the optimal policy in C-RLFT can be easily solved through single-stage, RL-free supervised learning, which is lightweight and avoids costly human preference labeling. Through extensive experiments on three standard benchmarks, our openchat-13b fine-tuned with C-RLFT achieves the highest average performance among all 13b open-source language models. Moreover, we use AGIEval to validate the model generalization performance, in which only openchat-13b surpasses the base model. Finally, we conduct a series of analyses to shed light on the effectiveness and robustness of OpenChat. Our code, data, and models are publicly available at https://github.com/imoneoi/openchat

arXiv.org e-Print Archive

Coherent phrase model for efficient image near-duplicate retrieval

Author: CHENG Xiangang
CHIA Liang-Tien
HU Yiqun
RAJAN Deepu
TAN Ah-hwee
XIE Xing
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/12/2009
Field of study

Institutional Knowledge at Singapore Management University

Porous single crystalline-like titanium dioxide monolith with enhanced photoelectrochemical performance

Author: Changtao Wang
Changtao Wang
Fangyuan Cheng
Kaipeng Liu
Kaipeng Liu
Kaipeng Liu
Kui Xie
Ling Liu
Ling Liu
Xiangang Luo
Xiangang Luo
Yunfei Luo
Yunfei Luo
Publication venue: 'Frontiers Media SA'
Publication date: 01/04/2023
Field of study

Macro-sized porous single crystalline-like (PSC-like) TiO2 is endowed with unique structural advantages due to its structural consistency and porosity in a large area, which would significantly enhance its photoelectrochemical function. However, there are significant technical challenges in the growth of porous single crystalline-like monoliths. The consistency of structure dominates the structure so that the grain boundary is reduced to the minimum, which is in contradiction with the three-dimensional percolation structure. Here we report a lattice reconstruction strategy based on solid-solid transformation to grow porous single crystal-like anatase TiO2 dominated by (200) and (101) facets at 2 cm scale. In comparison with the traditional definition of porous single crystal, it has two different lattice orientations, but still has good photoelectrochemical properties. The band gap engineering introduces Ti3+ gap into the lattice to generate TinO2n−1 with Magneli phase, limiting the created active structure to the lattice with two-dimensional surface, which would open a new avenue to create highly active surfaces to capture photons and transport electrons stably. The PSC-like TinO2n−1 provides enhanced exciton lifetime (3–5 ns) as a photocatalytic catalyst and shows significant visible light absorption. The independent PSC-like TinO2n−1 delivers high photocurrent of 1.8–5.5 mA · cm−2 at room temperature and does not decay for 10 h

Directory of Open Access Journals

A genetic variation map for chicken with 2.8 million single-nucleotide polymorphisms

Author: Aerts Andrea
Andersson Björn
Andersson Leif
Bartley Neil
Boardman Paul E
Bovenhuis Henk
Brandström Mikael
Bumstead Nat
Burt David W
Chen Chen
Chen Jie
Cheng Hans H
Consortium International Chicken Polymorphism Map
Crooijmans Richard P M A
Dai Mingtao
de Koning Dirk-Jan
Dong Le
Dong Wei
Ellegren Hans
Glavina Tijana
Gordon Laurie
Groenen Martien A M
Gunnarsson Ulrika
Hao Bailin
He Dandan
He Ximiao
Hillier Ladeana W
Hocking Paul M
Hu Songnian
Huang Xiangang
Huang Yanqing
Hubbard Simon J
Hunt Henry
Kaiser Pete
Kaufman Jim
Kindlund Ellen
Lamont Susan J
Lan Fengdi
Law Andy
Li Dawei
Li Guangyuan
Li Guoqing
Li Heng
Li Jun
Li Ning
Li Ruiqiang
Li Shengting
Li Songgang
Li Wenjie
Li Yuanzhe
Lin Wei
Liu Bin
Lucas Susan
Meng Qingshun
Morrice David
Ni Peixiang
Ovcharenko Ivan
Overton Ian M
Ponting Chris
Qi Qiuhui
Ran Longhua
Rogers Sally
Rothwell Lisa
Ruan Jue
Shi Jianping
Stubbs Lisa
Sun Yongqiao
Tammi Martti T
Tang Haizhou
Tong Wei
van der Poel Jan J
van Hateren Andy
Wahlberg Per
Walker Brian A
Wang Jian
Wang Jianjun
Wang Jing
Wang Jun
Wang Miaoheng
Wang Pei
Wang Xiaoling
Warren Wesley C
Webber Caleb
Wei Dong Qing
Wei Ning
Wilson Richard K
Wilson Stuart A
Wong Gane Ka-Shu
Xi Yan
Xie Fei
Yang Huanming
Yang Ning
Yang Shiaw-Pyng
Yang Xu
Yang Zheng
Ye Chen
Ye Jia
Young John R
Yu Jun
Yu Yingpu
Zeng Changqing
Zhang Jianguo
Zhang Jingjing
Zhang Xiaowei
Zhang Yunze
Zhang Zengjin
Zhang Zhenpeng
Zhang Zhi-Yong
Zhao Wenming
Zhao Yiqiang
Zheng Hongkun
Zheng Weimou
Zhou Huaijun
Zhou Jun
Zhou Yan
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2004
Field of study

We describe a genetic variation map for the chicken genome containing 2.8 million single-nucleotide polymorphisms ( SNPs). This map is based on a comparison of the sequences of three domestic chicken breeds ( a broiler, a layer and a Chinese silkie) with that of their wild ancestor, red jungle fowl. Subsequent experiments indicate that at least 90% of the variant sites are true SNPs, and at least 70% are common SNPs that segregate in many domestic breeds. Mean nucleotide diversity is about five SNPs per kilobase for almost every possible comparison between red jungle fowl and domestic lines, between two different domestic lines, and within domestic lines - in contrast to the notion that domestic animals are highly inbred relative to their wild ancestors. In fact, most of the SNPs originated before domestication, and there is little evidence of selective sweeps for adaptive alleles on length scales greater than 100 kilobases

Queen's University Belfast Research Portal

Edinburgh Research Explorer

Wageningen University & Research Publications

University of Gloucestershire Research Repository

The University of Manchester - Institutional Repository

University of Queensland eSpace

Digital Repository @ Iowa State University (ISU)

Online Research @ Cardiff

Oxford University Research Archive